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Abstract 

The multiuser communication channel, in which multiple users exchange information with the 
help of a relay terminal, termed the multi-way relay channel (mRC), is introduced. In this model, 
multiple interfering clusters of users communicate simultaneously, where the users within the same 
cluster wish to exchange messages among themselves. It is assumed that the users cannot receive each 
other's signals directly, and hence the relay terminal in this model is the enabler of communication. In 
particular, restricted encoders, which ignore the received channel output and use only the corresponding 
messages for generating the channel input, are considered. Achievable rate regions and an outer bound 
are characterized for the Gaussian mRC, and their comparison is presented in terms of exchange rates 
in a symmetric Gaussian network scenario. It is shown that the compress-and-forward (CF) protocol 
achieves exchange rates within a constant bit offset of the exchange capacity independent of the power 
constraints of the terminals in the network. A finite bit gap between the exchange rates achieved by 
the CF and the amplify-and-forward (AF) protocols is also shown. The two special cases of the mRC, 
the full data exchange model, in which every user wants to receive messages of all other users, and the 
pairwise data exchange model which consists of multiple two-way relay channels, are investigated in 
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detail. In particular for the pairwise data exchange model, in addition to the proposed random coding 
based achievable schemes, a nested lattice coding based scheme is also presented and is shown to 
achieve exchange rates within a constant bit gap of the exchange capacity. 

I. Introduction 

Relay terminals in wireless networks are instrumental in providing robustness against channel 
variations, extending coverage in the case of power limited terminals, and in improving energy 
efficiency. The three-terminal relay channel [1], one of the earliest models in network information 
theory, serves as a main building block for large wireless networks. More recently, it has been 
recognized that effective relaying protocols can be devised to facilitate cooperation between two 
users when they want to exchange information simultaneously over a single relay terminal. In 
this paper we introduce a new fundamental building block for general multicast communication. 
The model, termed the multi-way relay channel (mRC), considers multiple clusters of users such 
that the users in each cluster want to exchange information among themselves. This exchange 
is facilitated by a relay terminal that helps all the users in the system. We consider a total of N 
users grouped into L > 1 clusters of K > 2 distinct users each, i.e., iV = KL. 

This setup is general enough to model a variety of communication scenarios. Consider, for 
example, a peer-to-peer wireless network with groups of users sharing data with the help of a 
relay node. Here, the users who are interested in the same file can be grouped into clusters. Each 
user in the cluster might have a portion of the file that is required by the other users in the cluster. 
Many such clusters need to be served simultaneously by the relay terminal. Similarly, in a social 
network scenario, the clusters may be formed based on the connections among the users, and the 
users in each cluster, or a friend group, might want to exchange their personal information through 
the relay terminal (see Fig. [Qfor an illustration). In a sensor network scenario, clusters may be 
formed based on the physical phenomenon that the sensors are measuring, i.e., temperature 
sensors exchange local temperature among themselves while pressure sensors exchange local 
pressure measurements. As yet another example, consider multiple terrestrial (ad-hoc) networks 
with nodes geographically distributed and served by a single communication satellite. The nodes 
in each network may want to exchange available local information (e.g. control information) 
among all the network nodes. 

Our focus for this new model will be on the Gaussian channel. In particular, we have an 

DRAFT 



3 




Fig. 1. An illustration of the multi-way relay channel model in which the relay terminal helps two separate clusters of users 
in a social network to simultaneously exchange messages. 

additive Gaussian multiple access channel (MAC) from the users to the relay terminal, and 
a Gaussian broadcast channel from the relay to the users. It is assumed that the users do not 
receive each other's signals. This can be due to physical restrictions among the nodes in the sensor 
network scenario, due to large physical distances among the terminals in the satellite scenario, or 
due to the network protocol since infrastructure-based systems tend to have better performance 
and lower latency than ad-hoc ones. We study the most fundamental coding techniques that have 
been introduced in the literature for relay networks. In particular, we derive the achievable rate 
regions for the corresponding multi-way extensions of decode-and-forward (DF), amplify-and- 
forward (AF) and compress-and-forward (CF) protocols. 

In the DF scheme, the relay node is enforced to decode all the messages in the network. Since 
relay node is not originally a sink node, the decoding at the relay is not imposed by the problem, 
and hence, can be limiting with the increasing number of users in the system. In the CF scheme 
the relay quantizes its observation and broadcasts the quantized version to all the receivers. This 
circumvents the decoding requirement at the relay, but it suffers from the fact that the noise at 
the relay terminal is also quantized and forwarded to the users. 
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In all these achievable schemes, we exploit the fact that each user already knows its own 
message and can use this information for more effective decoding. While for the AF scheme, 
this can be achieved by each user simply subtracting its own transmit signal from the signal it 
receives; in the case of DF and CF schemes, this requires using a more involved coding scheme 
introduced in [2] and 0, respectively, for lossless and lossy broadcasting of a common source 
to receivers with correlated side information. 

To provide a performance comparison of the proposed coding schemes, we focus on the 
achievable symmetric rate, termed the exchange rate, for a symmetric network setup. We define 
the total exchange rate as the total rate of data that can be transmitted over the system while 
each user's message has the same rate. The supremum of achievable total exchange rates is 
called the exchange capacity. The investigation of the exchange capacity allows us to obtain 
simple explicit rate expressions, and acquire fundamental insights into the behavior of the 
communication protocols in consideration. We characterize analytically the exchange capacity 
upper bound and the total exchange rates achievable by AF, DF and CF schemes. 

We investigate two special cases of the mRC in detail: the full data exchange model in which 
each user wants to learn all the messages in the network, and the pairwise data exchange model 
in which multiple user pairs exchange information. It is shown in [4] that the CF scheme achieves 
within a half bit of the capacity for the symmetric TRC, while DF achieves the capacity when 
the additional sum-rate constraint is not the bottleneck. Similarly, nested lattice codes [|5l are 
shown in to achieve rates within a half bit of the capacity in a TRC. Here, we show that 
similar finite-bit approximation results for the exchange capacity can be obtained in the more 
general model of the mRC as well. We show that the CF scheme achieves total exchange rates 
within a finite-bit gap of the exchange capacity for any number of clusters and users. This limited 
rate loss is due to noise forwarding from the relay terminal to the users; however, its negative 
effect becomes less important as the number of users in each cluster increases since the relative 
strength of the noise variance diminishes. 

We also extend the nested lattice coding scheme to the pairwise data exchange model with 
multiple clusters, and show that employing nested lattice codes yields total exchange rates within 
a finite-bit gap of the total exchange capacity for any number of clusters. Using structured codes 
allows the relay to decode only a function of the users' messages rather than decoding each one 
of them, which fits well with the data exchange model considered here. 
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The study of multi-way channels for data exchange dates back to Shannon's work on two- 
way channels 0. In @, a multiuser extension of the two-way channel model is studied. These 
models do not include a relay terminal. The two-way relay channel (TRC), also known as the 
bidirectional relay channel, models the relay network with two-users exchanging information 
over a relay terminal. TRC has received considerable attention recently, see flU, J6]|, J9l, ifTOll . 
ifTTfl. |[T2l|. [fT3l . |fl4| and the references therein. The mRC model generalizes the TRC model; 
in the special case of L = 1 and K = 2, this model reduces to the TRC. 

In [fT51 . multiple simultaneous data transmissions over a relay network is studied with joint 
network and superposition coding. In an independent work closely related to ours lfT6ll . Cui et al. 
consider in particular the full data exchange model, and study AF, DF and CF schemes. Following 
our initial study [17], Ong et al. also studied the full date exchange model and characterized the 
exact capacity region for finite field channels |[T8l . The pairwise data exchange model is studied 
from the perspective of optimal power allocation for the special case with orthogonal channels 
in |fl9l and bit error rate analysis for interference limited scenarios in ll20ll . The multi-pair TRC, 
which corresponds to L clusters with K = 2, is also studied in [|2T]|. 

The following notation and definitions will be used throughout the paper. We denote the set 
{1, . . . , K} by X K for a positive integer K. For I e Z K , we have X K \l = {1, — 1, 1 + 
1, . . . , K}. We denote the sequence (Xl, . . . , X n ) by X™. We use C for a proposer subset, i.e., 
A\B is nonempty for any set B C A, while C is used for any subset. We define the function 
[x] + = max{0, x}. We also define the function C(x) for a non-negative real number x as 

C(x)4llog(l + x). 

The rest of the paper is organized as follows. The system model is introduced in Section HH 
A cut-set outer bound and inner bounds achievable by AF, DF and CF schemes are presented 
in Section [III] for the general model. Section [IV] focuses on the achievable exchange rate for a 
symmetric Gaussian network. Two special types of networks, the full data exchange and pairwise 
data exchange models, are studied in Section |V] and numerical results are provided for these 
cases. We close the paper with some conclusions highlighting our main results and the insights 
they provide. 
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II. System Model 

We consider a Gaussian mRC in which multiple users exchange messages with the help of a 
relay terminal, R. Users cannot overhear each other's transmissions, hence the relay is essential 
for communication. We consider full-duplex communication, that is, all terminals including the 
relay can receive and transmit simultaneously. There are L > 1 clusters of nodes in the network, 
where each cluster has K > 2 users. Users in cluster j, j E Tl, are denoted by Tji, . . . ,Tjk 
(see Fig. [2]). Wji E Wji is the message of user Tji, and user wants to decode the messages 
(Wji, . . . , Wjx) for j E Zk,2 E Ik, the messages of all the users in its own cluster. We 
denote the set of users in cluster j by Tj, and the set of all users by T. 

The Gaussian mRC is modeled as 



L K 



and 



3=1 i=l 



Yji [t] = X r [t] + Zji [t] , j El L and i E X L 



(1) 



(2) 



where Xji[t) and Yji[t] are the input and the output at user Tji at time t, respectively, while 
X r [t] and Y r [t) are the input and output at the relay, respectively. Y r [t] is the received signal at 
the relay and Yji[t] is the received signal at user 7}». Z r is a zero-mean Gaussian noise at the 
relay with variance N r , i.e., Z r ~ Af(0,N r ), and Z^ is the Gaussian noise at user Tji, where 
Zji ~ J\f(Q,Nji) for j E T. L ,i E T K . All noise variables are independent of each other and 
the channel inputs and independent and identically distributed (i.i.d.) over time. Average power 
constraints apply on the transmitted signals at the relay and at the users Tji for all j E Tl and 
iEl L : 

1 



-E 



and 



n 



l -E 

n 



t=l 



< P r . 



(3) 



(4) 



Furthermore, although we have a full-duplex operation, the effect of the transmitted signal of 
each user on its received signal is ignored since it is known at the transmitter, and hence can be 
subtracted. 
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Fig. 2. The mRC model with L clusters, each of which is composed of K distinct terminals. All terminals in a cluster want 
to receive the messages of all the other terminals in the same cluster. The relay terminal facilitates the data exchange between 
the terminals. 



As in many of the previous work 01, 0, lfl2l . we consider "restricted encoders" at the 
terminals such that the encoders cannot use their received signals for encoding, and hence, 
their channel input depends only on their messages. Naturally, the achievable coding schemes 
proposed in this paper apply to the case without restricted encoders as well; however, our outer 
bound, and hence, the finite-bit gap arguments are only valid under this restriction. 

A (2 niJn , . . . , 2 nRlK , . . . , 2 nRL \ . . . , 2 nRLK , n) code for the mRC consists of N = LK sets 
of integers Wji = {1,2,..., 2 nRjl } for j £ Xl and i £ Xk as the message sets, iV encoding 
functions fa at the users such that 

a set of encoding functions {/*}™ =1 at the relay such that 

x r , t = fAY?- 1 ), l<t<n, 
and N decoding functions at the users: 

//,, : y;' x VV ; ; > i>V ; . VV ; „). 

such that 

(w; 1 ,...,w; K ) = 9jl (w JU Yji). 
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The average probability of error for this system is defined as 

K = Pr |J {//,v(U' ; ,V/;) / (ir (l W jK )}. 

jex L ,iex L 

Observe that the condition P e n — > implies that individual average error probabilities also go to 
zero. We assume that the messages Wji, j E II, i G are chosen independently and uniformly 
over the message sets VVji. We define the rate vector for cluster j as R J = (Rji, ■ ■ ■ , -Rjx) for 
j = 1, . . . , L and denote the overall rate vector as H L ' K = (R 1 , . . . , R L ). 

Definition 1: A rate tuple H L ' K is said to be achievable for an mRC with L clusters of K 
users each if there exists a sequence of 

ni?n nnR 1K nnR L1 <ynR LK „\ 

)•••)* )•••)* )•••)* 5 

codes such that — )■ as n — >■ oo. The corresponding capacity region is the convex closure 
of all achievable rate tuples. 

III. The Achievable Rate Region 

In this section we provide inner and outer bounds on the capacity region of the Gaussian 
mRC. The outer bound is based on the combination of the classical cut-set bound Il22ll and 
a genie-aided bound. The proposed achievable rate regions are based on the relaying schemes 
originally developed for the classical one-way relay channel. In particular, we consider AF, DF 
and CF schemes, and identify the corresponding achievable rates. Unlike the classical one-way 
relay channel, in the multi-way relay setting, the transmitters can exploit the knowledge of their 
own transmit signals to improve rate region. 

A. Outer bound 

To derive an outer bound on the capacity region of the mRC, we first consider the cut-set 
bound. Choose any proper subset of users from each cluster, and let Sj C X K denote the set 
of users chosen from cluster j. We will consider the information flow from these users to the 
remaining users. Consider the cut formed by [Jj =1 <Sj, which forms a MAC to the relay and 
provides the following cut-set bound on the total rate that needs to be transmitted over this cut: 

j=l ies 3 \ r J 
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for all Sj C X K . 

Next we consider a genie-aided outer bound. Choose one user from each cluster. Assume that 
a genie provides to the remaining users and the relay all the messages in the network. Hence, 
the network only needs to transmit the messages in their clusters to the set of chosen users. 
Since relay already knows all the messages, the setup boils down to a broadcast channel from 
the relay to the set of chosen users. Note here that, while the previous cut-set outer bound is 
valid for non-restricted encoders as well, this genie-aided outer bound is based on the restricted 
encoder assumption, and hence, ignores the potential feedback signal that can be transmitted 
from the receivers. 

The capacity region of a Gaussian broadcast channel with L receivers, power constraint P r at 
the transmitter, and noise variances Ni, . . . , N L at the receivers is given as ll22l 

C BC (P r ,N u ...,N L )±\(R u ...,R L ):0<R j <c( L "f* — ) , 

L } 

aj > for j G X L , a j = 1 f > (6) 

j=l J 

where t x — 1 if x is true and otherwise. Assuming that user T^. , lj e Xk, j G Xl is chosen in 
cluster j, the total rate of the messages to this receiver is Rj. = J2ies 3 -^j* where Wj = Xx\{lj}. 
These rates need to be supported by the broadcast channel, i.e. we need: 

{Rl 1 ,...,R{' I )eC BC (P r ,N ll ,...,N l .) (7) 

for all choices of Z 1; . . . , l L , where lj G X K and j G X L . The intersection of the bounds in © 
and © provides us an outer bound on the capacity region of the mRC. 

Note that this is a capacity region outer bound for the case of restricted encoders as we ignored 
the feedback to the encoders. 

B. Amplify -and-forward (AF) Relaying 

In AF relaying, the relay terminal amplifies its received signal within its power constraint 
and broadcasts to the receivers. However, since the signals from users that belong to different 
clusters act as noise to each other, we consider time-sharing among clusters, and apply the AF 
strategy separately for each cluster within its own timeslot. Let Tj denote the portion of the 
channel allocated to cluster j with J2j=i T j = 1- Within the timeslot of each cluster, all the 
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users in that cluster transmit, and the relay scales its received signal and broadcasts to the users. 
Within the timeslot for cluster j the relay's transmit signal is given by 



Pr 

X r = \l p l +N ^ + ---+ X jK + Z r ), 

where P^ is the transmit power of user 7}, and P/ is the transmit power of the relay at timeslot 
j. We have < P^ < and Y% =1 ^jP 3 r < p r- Each user can cancel out the effect of its 
own transmit signal, and decodes the messages of the other users in its own cluster. Since the 
transmission from a user acts as noise on the other users' transmissions, users do not necessarily 
transmit at full power. At each receiver, we have a Gaussian MAC with K — 1 users, and we 
assume Gaussian codebooks are used. 

Proposition 1: For a Gaussian mRC with L clusters of K users each, the rate region char- 
acterized by the union of the rate tuples satisfying the following inequalities is achievable with 
AF relaying and time-sharing between clusters 

o<E<^cf \ 

kes \ N r+ * =1 P ? NjJ 

for all j e X L , I E 1 K and S C l K \l such that < P' H < ^ for all j e X L , % e 1 K , 
E^=i T i p i- < p r, t 3 > for j = 1, . . . , L and Tj = 1. 

C. Decode-and-forward (DF) Relaying 

In DF relaying, the relay decodes messages from all the users, and broadcasts each message 
to all its recipients. DF consists of two transmission phases: the first phase is the MAC from the 
users to the relay, and the second phase is the broadcast channel from the relay to the users. Note 
that, due to the full-duplex nature of the relay operation, these two phases occur simultaneously 
for consecutive message blocks. The messages of all users can be decoded at the relay at the 
end of the multiple access phase if 



for all W 1 C X L , W 2 <ZX K . 

In the broadcast phase, we consider time-sharing among clusters, that is, the relay divides 
the channel block into L timeslots proportional to tj > where J2j=i r j = 1- F° r 3 e ^l, 
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the relay broadcasts the messages Wji, . . . , W jK to users Tji, ... , T jK within the j-th timeslot. 
For broadcasting within the j-th timeslot, rather than broadcasting each message one by one to 
its intended receivers, the relay broadcasts all the messages simultaneously to all the receivers 
by using the coding scheme introduced in which exploits the availability of the users' own 
messages in decoding the remaining messages. 

In [|2|, Tuncel has considered broadcasting a source to multiple receivers each of which has 
its own correlated side information, and characterized necessary and sufficient conditions for the 
reliable transmission of the source to all the receivers. In our setting, we consider Wji, W jK 
as the source message within timeslot j and Wji as the correlated side information at user Tji for 
j G Xl, i £ Xk- In this coding scheme, the relay generates a codebook of size 2 nR ^ x 2 nRj2 x 
... x 2 nR jK f or Qac \Y cluster j, consisting of r^n-length codewords i.i.d. Gaussian jV(0, P/), 
where J2j=i T jPr ^ Pr an d J2j=i T j = 1- F° r eacri message combination (Wji, . . . , Wj K ) the 
relay transmits the corresponding codeword over the channel. Each receiver finds the message 
indices by joint typicality using its channel output and its own message, which acts as the side 
information in our model. The analysis of the coding scheme follows from [2]. This coding 
scheme is also used in 11231 . [|24| and [|25l for identifying the capacity region of broadcast 
channels with message side information. We can show that the messages can be decoded by all 
the users if 



Pi 



iexjc\{i} v 3 J 

for all j e X L and I e X K . 

Proposition 2: For a Gaussian mRC with L clusters of K users each, the rate region char- 
acterized by the union of the rate tuples satisfying the following inequalities is achievable with 
DF relaying: 



2^ 



J2 R ° F ^ C ( N Pj % for all Wi C Xl, W% C Xk, (11) 



(12) 



and 



RW ^ T i c ijf) > for a11 3 ^ x L,ie x K , (13) 

i&T K \{l} ^ jl/> 

such that Y%=i T jPr < Pr, Tj > for j = I, . . . , L and Y%=i T o : = 1- 
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D. Compress-and-forward (CF) Relaying 

Next, we consider CF relaying which was introduced in flU for the 'one-way' single relay 
channel. In the CF scheme in a one-way relay channel, the relay transmits a quantized version 
of its received signal to the destination. Since the destination has its own received version of 
the source signal, which is correlated with the relay's received signal, the relay exploits this 
correlated side information at the receiver by using Wyner-Ziv compression E6l . Then the 
destination combines its received signal and the quantized version of the relay's received signal 
to decode the underlying source message. 

Note that in our mRC setup the users do not overhear each other's signals; however, they 
still have access to the side information correlated with the relay's signal: their own transmit 
signals. Therefore, we propose a transmission scheme for the multi-way relay channel based on 
CF relaying that exploits this side information at the users. 

In the CF scheme proposed in [21 for the TRC and extended in [fTTll to the mRC, the relay 
terminal quantizes its received signal and broadcasts this quantized channel output to the users. 
Hence, similarly to the DF scheme, the scenario is equivalent to broadcasting a common source 
to multiple receivers with correlated side information. However, note that, unlike the DF case, 
here we are interested in broadcasting a quantized version of the relay's received signal, rather 
than its lossless transmission. Yet it is possible to employ a coding scheme similar to the one 
used for DF relaying to exploit the side information at the users simultaneously. Improvement 
in the achievable rates is possible by employing layered digital codes as in or by further 
exploiting analog transmission as in ll27ll . However, this type of CF scheme, despite not using 
explicit binning, still requires decoding of the quantized relay signal at the users, which is not 
a requirement of the problem. Instead, the users can directly decode the messages of the other 
users without decoding the quantized relay signal. This coding scheme is originally considered 
in lfl2l for the TRC, and recently generalized to multiple relay networks in ll28l . A variation of 
the CF scheme in which the receivers decode only the bin indices rather than the compression 
indices before decoding the message indices is studied in ll29ll . 

We first provide an achievable rate region for a general discrete memory less channel in which 
the channel from the users to the relay is characterized by the conditional probability distribution 
p(y r \xi, . . . , xk) and the channel from the relay to the users is characterized by p(yi, . . . , yfc\xr)- 
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Note in this model that, the channel output at a terminal does not depend on the channel input of 
that terminal. This is in accordance with the Gaussian model, in which case the known channel 
input can be subtracted from the output of each user. We consider a single cluster to simplify 
the rate region expression and drop the cluster index in the random variables. Later, we use this 
expression to obtain an achievable rate region for the Gaussian model with multiple clusters. 

Theorem 1: For a discrete memory less mRC with K users exchanging information among 
each other, the rate tuples satisfying the following inequalities are achievable by CF if, 



Y,R°k F < mm I I(X(S);Y r \X(S c ),Q) 

fceS I 

for all S C X K , for some probability distribution in the form 



mm I(X r ;Y t \Q) - I(Y r ;Y r \X K ,Q) 



(14) 



p{q)p{x 1 \q) ■ ■ ■p(x K \q)p(x r \q)p(y r \x 1 , . . . , x K )p(y r \y r )p(y 1 , . . .,y K ). 

Proof: See Appendix lAl for the details. ■ 
Remark 1: In our achievable coding scheme, the users transmit a new message in every 
channel block and the relay quantizes and forwards its observation without Wyner-Ziv binning 
as in 0. Each destination decodes the messages of the remaining users by joint typicality after 
receiving the signal transmitted by the relay at each channel block. The users directly decode the 
message indices without trying to decode the quantized relay codeword first. While repetition 
coding and joint decoding is considered in ll28l . our result illustrates that this is not needed 
in the mRC network setup considered here. A similar result is recently obtained for a single 
source- single destination multiple relay network in ll30l . 

In the Gaussian setup, as in AF, we consider time-sharing among the user clusters in the 
multiple access phase as well as in the broadcast phase. This will prevent multiple user clusters 
from interfering with each other's signals, which would decrease the quality of the quantized 
signal broadcasted by the relay. 

Proposition 3: For a Gaussian mRC with L clusters of K users each, the rate tuples satisfying 
the following inequalities are achievable by CF: 



V^<r J -minjcfg^^Vc'f ^ )- g feU (15) 



for all j E X L and Sj C X K and some N 3 Q > 0, such that P- { < for all j E X L and i G X K , 
ELi T jP 3 r < Pr, T i > for j = 1, . . . , L and Y%=i r j = l - 
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Proof: We apply time- sharing between clusters, hence the achievable CF rates for each 
cluster is scaled with the portion of time tj allocated to that cluster. We also allow the relay 
to allocate its power among various clusters. We let the users generate Gaussian codebooks. 
In particular, we let Xji ~ A/"(0,PjJ and X r ~ Af(0,P r ), where P^ < Pji. Without claiming 
optimality, we also let the quantization noise at the relay for quantizing the received signal Yj 
for cluster j be Gaussian, that is, 

Y r j =Y r j + Qi, (16) 

where Qj is a Gaussian random variable with Qj ~ A/"(0, Nq), Nq > 0, and independent of Yj. 
Calculating the mutual information expressions for these Gaussian random variables results in 
the above rate region. ■ 

E. Lattice Coding 

In the previous sections, we have concentrated on various random coding schemes for com- 
munication over the mRC. Recently, it has been shown in ifTTI . J6]|, OTTl and 11321 . that nested 
lattice codes can be effective in achieving higher rates in some Gaussian networks by exploiting 
the topology of the network. Basic motivation in employing lattice codes in these architectures 
is to allow the relay nodes to decode only the modulo sum of the messages rather than decoding 
the individual messages. 

Unfortunately this structured coding scheme does not directly scale with increasing number 
of simultaneously transmitting users at each instant, that is, by knowing the modulo sum of 
more than two messages and only one of the messages, the users cannot decode the remaining 
messages. Hence, in our setup, we concentrate on the use of lattice codes for the case with K = 2. 
This model is equivalent to having multiple two-way relay channels served simultaneously by a 
single relay terminal EOl . lfT9ll . We term this model the mRC with pairwise data exchange. 

In this section, we provide an achievability scheme based on nested lattice codes [|33l . in 
which each user in the same pair uses a lattice structure to transmit its messages so that the 
addition of any two message points is also a member of the lattice. The relay terminal decodes 
the modulo sum of the transmitted lattice points, and then broadcasts this modulo sum to both 
users, each of which can decode the other user's message by subtracting its own message. 
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We next provide a brief review of nested lattice coding that will be required for the presentation 
of the coding scheme (see ll33ll or ll34ll for further details). An n-dimensional lattice A is defined 

as 

A = {GX : X G Z n }, 

where G G R n is the generator matrix. For any x G R n , the quantization of X maps X to the 
nearest lattice point in Euclidean distance: 

Q A (X) ^argmm||X-Q||. 

The mod operation is defined as 

X mod A = X -Q A (X). 

The fundamental Voronoi region V(A) is defined as V(A) = {X : Q\(X) = 0}, whose volume 
is denoted by V(A) and is defined as V(A) = f v , A ^ dX. The second moment of a lattice A is 
given by 

^ 2 (A) = tTTaT / W x W 2dx > 
nV{A) J V(A) 

while the normalized second moment is defined as 

a 2 (A) 



G(A) 



V(A) 2 / n ' 

We use a nested lattice structure as in 0, where A c denotes the coarse lattice and A/ denotes 
the fine lattice and we have A c C A^. All transmitters use the same coarse and fine lattices for 
coding. We consider lattices such that G(A C ) w j— and G(Af) w j—, whose existence is shown 
in [[H. hi nested lattice coding, the codewords are the lattice points of the fine lattice that are 
in the fundamental Voronoi region of the coarse lattice. Moreover, we choose the coarse lattice 
(i.e., the shaping lattice) such that cr 2 (A c ) = P to satisfy the power constraint. The fine lattice 
is chosen to be good for channel coding, i.e., it achieves the Poltyrev exponent O. 

We assume that both of the users Tji and T j2 in pair j use the same nested lattice structure for 
coding, and hence, achieve the same rate R l ^ tUce _ We also assume that both users have the same 
power constraint Pj, as additional power at one of the users would be useless in the proposed 
scheme. However, we want to note here that it is also possible to combine this lattice coding 
scheme with a random coding scheme as in ll35l such that the user with more power available 
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can superimpose an additional random code on top of the lattice code, and hence, achieve a 
higher data rate. 

We use time division among the user pairs in the transmission to the relay terminal. Each 
transmitter maps its message Wji to a fine lattice point Vji E Ay flV(A c ), j G T L and i = 1, 2. 
Each user employs a dither vector Uji which is independent of the dither vectors of the other 
users and of the messages and is uniformly distributed over V(A C ). We assume all the terminals 
in the network know the dither vectors. The transmitted codeword from transmitter Tji is given 
by 

Xji = {Vji - Uji) mod A c . 

It can be shown that Xji is also uniform over V(A C ). 

The relay decodes the modulo sums of the messages, Vj = (Vji + Vjz) mod A c , instead of 
decoding individual messages. Due to the group structure of the lattice, Vj also belongs to the 
fine lattice. Moreover, it is possible to show that Vj is also uniformly distributed over the fine 
lattice points within the Voronoi region of the coarse lattice, i.e., over the set Af n V(A C ). 

Following ll33l and ifTTfl. it is possible to show that there exist nested lattices at rates arbitrarily 
close to 

/ P 1 

T)lattice /"H" I J 

where J2j=i T j = 1 anc ^ C + (x) — C(x) if x > 1 and otherwise. This allows the relay to 
decode V,'s with vanishing error probability. 

For the broadcasting of the modulo sums from the relay to the pairs, the rate is bounded by the 
rate that can be transmitted to each user, i.e., we need (^ / 1 attlce ; . . . ; Riattice^ e C BC (P r , Ni, . . . , N L ), 
where Nj = max{A^i, iV^}. 

IV. Exchange rate for a Symmetric Network 

In this section, we focus on a symmetric network with equal power constraints at the users, 
Pji = P, and compare the achievable equal rate points with the proposed relaying schemes, 
i.e., Rji = R for all j e II and i & Z/> Exchange rate analysis will allow us to compare 
these schemes analytically for different numbers of clusters and users and with different power 
constraints. We say that a total exchange rate of R t is achievable for a system with L clusters 
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and K users in each cluster if ■ ■ ■ , jfe) is an achievable rate tuple. The exchange capacity 
is defined as the supremum of all achievable total exchange rates, i.e., 

C£* = swp{LKR : (R, . . . , R) is achievable}. 

We find lower and upper bounds on the exchange capacity of the network. In general, these 
bounds do not match and the exchange capacity of the mRC remains open. However, we show 
below that the gap between these two is less than a finite number of bits which is independent 
of the power constraints of the users. In the analysis of the exchange capacity, to simplify the 
notation and to focus on the fundamental behavior of the analyzed schemes, we consider a 
symmetric network with P^ = P and iV r = Nji = 1 for all j EX L , i EX L . 

We start with the upper bound on the exchange capacity. For a symmetric Gaussian mRC 
with L clusters of K users each, the exchange capacity is upper bounded by 

R L ub =j^mm{C(L(K-l)P),C(P r )}. (18) 

With AF relaying, the achievable total exchange rate is found as follows from Proposition Q] 
by letting Tj = 1/L, P* = P r and = LP: 

r l,k_ K ( L(K-l)PP r \ 

Raf ~ (K^T)° [l + LKP + P r ) ■ (19) 



In a symmetric Gaussian mRC with L clusters of K users each, the following exchange rate 
is achievable with DF relaying by letting Tj = 1/L and P-l = P r : 

Rt£ = mm ^C(LKP), -J^C(P r )} . (20) 



Remark 2: Comparing (1201) and (fT8l) . we can show that DF achieves the exchange capacity 
when 

P r < (l + LKPf-T< - 1. 

This corresponds to the case in which the relay power is the bottleneck, i.e., the exchange 
capacity is limited by the rate at which the relay can broadcast to the users. The range of P r 
for which DF is optimal increases as the number of clusters, the number of users within each 
cluster, or the power constraint P of the users increases. 

Finally, the total exchange rate achievable by CF over a symmetric network is given by 



CF K-l \l + L{K-l)P + P r ' 
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Remark 3: Comparing (fT9l) and (I2TT) . we observe that, for an arbitrary number of clusters and 
terminals within each cluster (L > 1, K > 2), the total exchange rate achieved by AF is lower 
than CF. Yet, we remark that the simplicity of AF relaying compared to CF may be attractive 
in practice. Moreover, the gap between the two is upper bounded: 

K , , K s 



T>L,K _ r>L,K ^ 
^CF ^AF 



log( 



(22) 



2(K- 1) '~^K- r 
which is independent of the power constraints and the number of clusters. 

In the next theorem, we prove that the CF protocol achieves rates within a constant number 
of bits of the exchange capacity for an arbitrary number of clusters and users independent of 
the available power at the users and the relay. 

Theorem 2: For a symmetric Gaussian mRC with L clusters of K users each, the CF protocol 
achieves rates within 2 (k-i) bits °f the exchange capacity. 

Proof: First, assume that P r > L(K—l)P. Then we have the following chain of inequalities: 
^ K K „( L(K-l)PP r 



R, 



CF 



-C 
K-l 

K 

""2(a:-i; 

K 

~2(#-i; 



l + L(K-l)P + P r , 

L(K - l)PP r 



log 1 + 



> R L > K 



l + L(K-l)P + P r 

log(l + L(K - 1)P) + log 

K , f l + P 
log 



1 + L(K — \)P + P r 



2(K-r 

K 



l + 2P r 



where (T26l) follows from the assumption that P r > L(K — 1)P. 
Next, assuming P r < L(K — 1)P, we have 



(23) 
(24) 
(25) 
(26) 
(27) 



CF 



2{K-1) 



> R UB + 



log(l + P r ) + log 



>R 



2{K-l 
K 



log 



1 + L(K- l)P 
1 + L(K- 1)P + P r 
1 + L(K - 1)P 



(28) 



l + 2L(K-l)P 



UB 



2(K-iy 



(29) 



Remark 4: It is noteworthy that the constant gap to the capacity is only a function of K, and 
is independent of the number of clusters and the power constraints of the users and the relay. We 
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can conclude that CF is nearly optimal in the high power regime for which the finite bit gap to 
the capacity becomes negligible. Note that this finite bit gap is bounded by one bit independently 
of the number of users within each cluster K and decays to half a bit as K increases. 

Remark 5: A direct consequence of Remark [3] and Theorem [2] is that the AF protocol achieves 
total exchange rates within ^(ir-if^ mts °^ me excnan g e capacity. We can further bound this 
gap from above by 1 + log K, which scales with an increasing number of users within each 
cluster. 

V. Special Networks 
A. The Multi-way Relay Channel with Full Data Exchange 

In this section we consider a special mRC with a single cluster L — 1, that is, each user wants 
to decode all the messages in the system. We term this model the mRC with full data exchange. 
A similar model, the multiway channel, in which there is no relay terminal, and the users can 
receive each other's signals is considered in [8]. 

Let us assume that the relay's power scales with the number of users, i.e., P r = KP. In this 
case we 

have R ]f B = K-c((K-i)P) and r x,k = c(Kpy We 

can see from these expressions 

that, with increasing power, the gap between the two increases and can be arbitrarily large when 
P is very high. In Fig. [31 we plot the upper bound and the achievable exchange rates for this 
setup. We see from the plot that the gap between the upper bound and the achievable exchange 
rate with DF diverges quickly with increasing power especially in the case of small numbers 
of users. The total exchange rate decreases with the increasing number of users in the system. 
We have a finite gap between the achievable rate of the CF scheme and the upper bound at all 
power values. A similar finite bit gap is also observed between the CF and AF schemes as was 
shown analytically. Especially for a small number of users, the rate of CF dominates the rate 
of DF for a wide range of power values. On the other hand, DF achieves higher exchange rates 
than CF in the low power regime. The range of power values in which the DF dominates CF 
gets larger with the number of users in the system. This is due to the fact that CF forwards 
more noise when there is increased interference. Similar observations can also be made when 
the relay power does not scale with the number of users, i.e., P r = P, which is illustrated in 

Fig. a 
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Fig. 3. Total exchange rate versus the user power, P. The relay power is equal to the total user power, i.e., P r — KP. We 
illustrate rates for K — 2 (the lines with the marker), and K — 20 users. 



In Fig. [51 we plot the upper bound and the achievable total exchange rate versus the number 
of users for the mRC with full data exchange. The lines marked with a circle correspond to the 
case where the relay power scales with the number of users as P r = KP, while the unmarked 
lines correspond to the case where the relay power is fixed as P r = P. From Theorem |2l the gap 
between the upper bound and the achievable total exchange rate with CF for L = 1 is 2 (k-i) • 
This gap approaches 0.5 bits as the number of users K increases independently of the power 
constraints. We can see that the gap is much smaller when the relay power is equal to the power 
constraint of each user. With the number of users increasing, both DF and CF get very close to 
the upper bound. The DF scheme achieves the upper bound with a smaller number of users in 
the system for the power constraint considered in this figure. In both cases, the achievable rate 
of the AF scheme is very close to the one achieved by the CF scheme. The gap between the 
two decreases with the number of users in the system. Note also the initial behavior of the total 
exchange rate with increasing number of users for the case P r = KP. The rate falls sharply due 
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L = 1, P = P 

r 




P (dB) 

Fig. 4. Total exchange rate versus the user power with P r = P. We illustrate rates for K — 2 (the lines with the marker), 
K = 4 and K = 8 users. 

to the interference introduced by the new users. However, the effect of the interference saturates 
after a certain number of users, and the total exchange rate starts increasing again. We can prove 
analytically that the exchange capacity goes to infinity as the number of users goes to infinity if 
the relay power is scaled with the number of users, whereas it saturates when the relay power 
is kept constant. 

B. The multi-way Relay Channel with Pairwise Data Exchange 

In the previous subsection we focused on full-data exchange, in which case each user wants to 
decode the messages of all other users. This constitutes one extreme in the mRC model. Another 
extreme would be to assume that users are paired, and each user is interested only in the data 
of its partner, i.e., L > 1 and K = 2. For the pairwise data exchange model in addition to the 
random coding schemes, we also have a nested lattice coding scheme provided in Section IIII-EI 

For the lattice coding scheme in a symmetric network with L > 1 clusters, we use time-sharing 
among the clusters for both the lattice coded multiple access and the broadcast phases. Each 
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Fig. 5. Achievable total exchange rate versus the number of users with P — 30 dB. The lines with the marker correspond to 
the case with P r — KP, while the nonmarked curves correspond to the case with P r — P. 



jfiL = min < ™ i"'~ r ^* [> . ( 30) 



pair will transmit 1/L portion of the timeslot using the same nested lattice code while scaling 
their power level accordingly. Then the relay broadcasts each pair's modulo sum to both users 
over 1/L portion of the timeslot. 

For the broadcasting of the modulo sums from the relay to the pairs, the rate is bounded by 
jC(P r ). Hence, the following exchange rate can be achieved by nested lattice codes: 

' max{0, C (LP - ±)} C(P r ) 
~L ' L 

Remark 6: We can see from ([301) that lattice coding achieves the exchange capacity if < 
P r < LP — |. In general, assuming that LP > 1/2, the total exchange rate achievable by lattice 
coding is within bits of the total exchange capacity and this gap decays to as LP goes 
to infinity, i.e., lattice coding achieves exchange capacity if either the number of users or the 
power constraint of the users goes to infinity. 

In Fig. [6l we illustrate the upper bound and the achievable total exchange rates for the pairwise 
data exchange model with L = 8 pairs as functions of P, while P r = 2LP. Similar observations 
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P = 2LP, K=2, L=8 

r ' ' 




User Power (dB) 



Fig. 6. Exchange capacity upper bound and achievable rates versus power P for the pairwise data exchange model. 



as in Section IV-AI apply for DF, CF and AF schemes. As the power constraint P increases, 
lattice coding quickly outperforms other schemes and gets very close to the upper bound. In 
Fig. [7] we plot the total exchange rates with respect to the number of pairs in the system for 
P = — 5 dB and P r = 2LP. We can see that, similar to the behavior seen in Fig. [61 lattice 
coding improves as the number of pairs increases and gets very close to the upper bound, while 
CF and AF follow the upper bound within a finite bit gap uniform over the power constraints. 

VI. Conclusion 

We have considered the Gaussian multi-way relay channel in which multiple clusters of 
users communicate simultaneously over a single relay terminal (no cross-reception between 
the users), and the users in each cluster want to exchange information among themselves. We 
have characterized the achievable rate region with AF, DF and CF schemes. When each cluster is 
composed of two users, we have characterized the rate region achievable by nested lattice coding 
as well. Specializing our results to the case of exchange rate points over symmetric networks, 
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P = -5dB, P = 2LP, K=2 
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Upper bound 
DF 




1 23456789 10 

Number of Pairs, L 



Fig. 7. Exchange capacity upper bound and achievable rates versus the number of pairs in the system for the pairwise data 
exchange model. 

we have shown that the CF scheme achieves exchange rates within a constant bit offset from 
the exchange capacity, while this constant gap is independent of the number of clusters and the 
power constraints of the nodes. The gap between the total exchange rates achieved by CF and 
AF schemes is also shown to be below a certain finite number of bits. Finally, we have shown 
that the nested lattice codes achieve rates within a finite bit gap of the exchange capacity for 
the case of multiple clusters with two users each, and that lattice coding outperforms all other 
schemes in this setup. 

These results point to the fact that the additional decoding requirement at the relay node, 
imposed in the case of DF relaying, might be limiting in terms of the achievable exchange 
rates, and relaxing this requirement might lead to rates that are very close to the capacity in 
certain scenarios. While the decoding requirement is completely removed in the case of AF and 
CF relaying, it is relaxed in the case of lattice coding. It is an interesting research direction 
to explore other decoding requirements at the relay terminal with structured codes, that will 
be helpful in the case of clusters with multiple users. Our results provide design insights, in 
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Fig. 8. Illustration of the CF coding scheme. 

particular towards the relaying techniques to be employed and how close their performance is 
to the ultimate capacity bounds for this multi-way cooperative communication model. 

Appendix A 
Proof of Theorem [A] 

For simplicity of notation, we consider the case with a constant Q. The achievability for 
general time-sharing random variable can be obtained by using the classical arguments Il22ll . 

A block Markov encoding structure is considered, in which the messages are coded into B 
blocks, and are transmitted over B+l channel blocks. The relay forwards the information relating 
to each message block over the next channel block. The relay is kept silent in the first channel 
block, while the transmitters are silent in the last one. The receivers decode the messages from 
the relay's transmission right after each block. Since there is no coherent combining, transmitters 
send only new messages over each channel block, and thus sequential decoding over each block 
is sufficient. 

Codebook generation: Fix p{x\) ■ ■ ■ p(xK)p{x r )p(y r \yr) ■ The random codebook at user i is 
generated i.i.d. from the distribution YYj=iP( x i,j) f° r eacn message Wi G [l,2 nfli ]. We also 
generate 2 nR Q quantization codewords i.i.d. according to YYj=iP(yr,i), an d for each of these we 
generate one relay codeword i.i.d. with YTj=i P{ x r,j)- We enumerate these codewords as i)r(w) 
and x™(w), respectively, for w G {1, . . . , 2 nR Q}. 

Encoding: See Figure [8] for an illustration of the encoding scheme over the channel blocks. 
Transmitter Tj transmits the codeword x^{wi,b) at channel block b = 1, . . . , B. All users transmit 
the codewords corresponding to message index 1 at the last channel block. The relay, upon 
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receiving y^{b), looks for an index s& such that the corresponding codeword Y™(sb) is jointly 



typical with y^{b), i.e., (i/~(&), Y n (s&)) G T" ^, 111 If no, or more than one such s b is found, it 



Q. If no, 

sets St = 1. Then the relay transmits X"(s 6 ) in the channel block 6+1. 

Decoding: Upon receiving y™(b), the user i looks for the set of messages indices 

% % ^ i ^ i \ 

w l,b-li • • • > w i-l,b-l) w i,b-li w i+l,b-li ■ ■ ■ i w K,b-l) 

such that there exists an index s G [1, 2 nRQ ] for which 

wct&i^o, • • • ,^r-i(^-i,6-i)^rK6-i),^r + i(^i,6-i), • • • ,^(t&W)^( a )) G ^ Xk %. 

and 

are simultaneously satisfied. If no, or more than one such set of message indices are found, then 
we set w{ >b _ x = ■■■ = ^_i )6 _i = i^+i,&-i = • • • = w^s-i = 1. 

Error Analysis: Note that the error probability at each channel block is independent from 
the others. Hence, we will consider the error probability for each channel block separately as 
the total error probability will be bounded by the sum. For simplicity we will drop the channel 
block indices in the variables. Let Wi, . . . , Wk denote the messages of the users, and S denote 
the quantization index chosen by the relay. 

We define the following error events. 

ex = {(l/?, Y r n (s)) i T { y rh for any s G [1,2"*)} , 

and 

£2(101, . . . , Wi-i, Wi+i, w K ) = 

{(X?( Wl ), . . .^Xl^w^xUw^X^w^), . . .,X K (w K ),Y r n (s)) G T£ Xi Xk%w and 

(X?(s),y?) G T^ XrYils for some s G [1,2^]}, (31) 

Assuming, without loss of generality, that W\ = ■ ■ ■ = Wk = 1 and 5 = 1, the error 
probability can be upper bounded by 

P( £l ) + P(uf:^ n 4' c (i)) + P(uf =1 u w/1 4' c (w)), 

'The set of 5- typical n-tuples according to Px is denoted by Tnq . 



DRAFT 



27 



where e c is the complement of the event e, w G Z K 1 with Wj G [1, 2" Rl ] and 1 is the length- 
(lf — 1) vector of Is. We can further upper bound this by 

K K 

p( ei ) + £ P(e? n 4 c (i)) + £ P(u w ^4 c (w)). 

i=l i=l 

Note that, as n ->■ oo, P(V,) -> if R Q = I(Y r ; Y r ) + e, and n 4' c (l, . . . , 1)) ->■ from 
the properties of the typical sets Il22ll . 

We can bound the last error term as follows. 

P(Uw#i4' c (w)) = £ ]T P(U w ^4 c (w| S = l)) + P(U w ^4' c (w| S ^l)) (32) 

Scljf\{i} w:uy=:U^;eS 
5cXif\{i} 

+ 2 n-R(S) . 2«^Q . 2~ n ( I ( xK l^)-e') . 2-n(/(X r ;yi)-e") (33) 

These two terms go to as n — >■ oo if 

i?(<S) < min{/(X(«S); Y r \X(S c )), I(X K ; Y r ) + I(X r ; Y t ) - I(Y r ; Y r )} - e (34) 

= min{/(X(<S); F r |X(5 c )), I(X r ; Y t ) - J(Y r ; Y r \X K )} - e, (35) 

for appropriately chosen positive e, e', e" and e. 

Hence, the rates should satisfy the following set of inequalities 

£ ^ F < mmmin{/(X(<S); Y r \X(S c ), Q), I(X r ; Y t \Q) - I(Y r ; Y r \X K , Q)}, (36) 
fees 

for all S C T K , which is equivalent to the form given in the theorem. 
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